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ABSTRACT 


In  this  thesis,  a  new  fictitious  play  (FP)  procedure  is  presented  to  solve  two- 
person  zero-sum  (TPZS)  Blotto  games.  The  FP  solution  procedure  solves  TPZS  games  by 
assuming  that  the  two  players  take  turns  selecting  optimal  responses  to  the  opponent’s 
strategy  observed  so  far.  It  is  known  that  FP  converges  to  an  optimal  solution,  and  it  may 
be  the  only  realistic  approach  to  solve  large  games.  The  algorithm  uses  dynamic 
programming  (DP)  to  solve  FP  subproblems.  Efficiency  is  obtained  by  limiting  the 
growth  of  the  DP  state  space. 

Blotto  games  are  frequently  used  to  solve  simple  missile  defense  problems.  While 
it  may  be  unlikely  that  the  models  presented  in  this  paper  can  be  used  directly  to  solve 
realistic  offense  and  defense  problems,  it  is  hoped  that  they  will  provide  insight  into  the 
basic  structure  of  optimal  and  near-optimal  solutions  to  these  important,  large  games,  and 
provide  a  foundation  for  solution  of  more  realistic,  and  more  complex,  problems. 
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EXECUTIVE  SUMMARY 


Fictitious  play  (FP),  first  introduced  by  Brown  and  Robinson  (1951),  is  an 
iterative  procedure  used  to  approximate  solutions  to  two-person  zero-sum  (TPZS)  games. 
At  each  iteration  of  FP,  each  player  chooses  a  pure  strategy  that  is  a  best  reply  to  the 
mixed  strategy  represented  by  the  aggregation  of  all  of  other  player’s  pure  strategies 
played  so  far,  assuming  they  will  be  chosen  based  on  the  empirical  probability 
distribution  induced  by  their  historical  frequency  in  all  previous  iterations.  Fictitious  play 
can  be  thought  of  as  mimicking  the  behavior  of  players  learning  from  their  opponents. 

The  purpose  of  this  thesis  is  to  investigate  the  use  of  fictitious  play  in  the  solution 
of  Blotto  games  and  their  generalizations.  In  Blotto  games,  opponents  each  allocate  a 
limited  number  of  forces  to  a  specified  number  of  areas.  Payoffs  in  each  area  accrue  to 
the  players  based  on  the  number  of  forces  assigned  to  each  area.  The  main  application  of 
Blotto  games  has  been  the  analysis  of  missiles  attack  and  defense  problems. 

In  this  thesis  a  new  fictitious  play  (FP)  procedure  is  presented  to  solve  two-person 
zero-sum  (TPZS)  Blotto  games.  The  FP  solution  procedure  solves  TPZS  games  by 
assuming  that  the  two  players  take  turns  selecting  optimal  responses  to  the  opponent’s 
strategy  observed  so  far.  It  is  known  that  FP  converges  to  an  optimal  solution,  and  it  may 
be  the  only  realistic  approach  to  solve  large  games.  The  algorithm  we  develop  uses 
dynamic  programming  (DP)  to  solve  the  FP  subproblems.  Efficiency  is  obtained  by 
limiting  the  growth  of  the  DP  state  space.  We  derive  the  dynamic  programming 
recurrence  relation  for  solving  Blotto  games  with  a  general  payoff  function  using 
fictitious  play.  The  recurrence  can  be  solved  without  explicitly  keeping  track  of  every 
attack  or  defense  played;  rather,  the  information  required  is  simply  the  number  of  times  a 
given  force  level  (number  of  attackers  or  defenders)  has  been  used  in  each  area,  over  all 
attacks  and  defenses  seen  so  far.  Although  our  experiments  considered  one  type  of 
attacker  and  one  type  of  defender,  we  indicate  how  to  generalize  this  procedure  to  cases 
with  more  than  one  type  of  attacker  or  defender  (or  both). 


During  this  study,  we  identified  other  topics  for  further  investigations.  The  first  is 
to  investigate  generalizations  of  Blotto  games  in  which  defenders  can  be  placed  in  such  a 
way  as  to  defend  multiple  areas  at  once.  This  is  closer  to  the  real  situation  with  missile 
defense.  The  second  is  to  explore  the  issue  of  playability  in  the  ILP  formulations.  Our 
proposed  playability  constraint  is  currently  too  restrictive;  we  have  provided  examples  in 
which  the  optimal  solution  to  the  ILP,  with  the  playability  constraint,  is  not  equal  to  the 
value  of  the  game.  Further  research  should  explore  less  restrictive,  alternate  formulations 
of  playability  constraints.  It  is  possible  (although  unlikely)  that  less  restrictive  playability 
constraints  would  also  yield  more  efficiently  solvable  ILP,  making  that  approach 
competitive  with  the  DP-based  procedure  for  larger  problems. 

While  it  is  unlikely  that  the  models  presented  in  this  paper  can  be  used  directly  to 
solve  realistic  offense  and  defense  problems,  it  is  hoped  that  they  will  provide  insight  into 
the  basic  structure  of  optimal  and  near  optimal  solutions  to  these  important,  large  games. 


xiv 


I.  INTRODUCTION 


Fictitious  play  (FP),  first  introduced  by  Brown  and  Robinson  (1951),  is  an 
iterative  procedure  used  to  approximate  solutions  to  two-person  zero-sum  (TPZS)  games. 
At  each  iteration  of  FP,  each  player  chooses  a  pure  strategy  that  is  a  best  reply  to  the 
mixed  strategy  represented  by  the  aggregation  of  all  of  other  player’s  pure  strategies 
played  so  far,  assuming  they  will  be  chosen  based  on  the  empirical  probability 
distribution  induced  by  their  historical  frequency  in  all  previous  iterations.  Fictitious  play 
can  be  thought  of  as  mimicking  the  behavior  of  players  learning  from  their  opponents. 

The  purpose  of  this  thesis  is  to  investigate  the  use  of  fictitious  play  in  the  solution 
of  Blotto  games  and  their  generalizations.  In  Blotto  games,  opponents  each  allocate  a 
limited  number  of  forces  to  a  specified  number  of  areas.  Payoffs  in  each  area  accrue  to 
the  players  based  on  the  number  of  forces  assigned  to  each  area.  The  main  application  of 
Blotto  games  has  been  the  analysis  of  missile  attack  and  defense  problems. 

This  thesis  is  organized  as  follows:  in  Chapter  II,  we  introduce  TPZS  games, 
Blotto  games  and  the  solution  procedure.  In  Chapter  III,  we  solve  various  versions  of  the 
problem  using  FP.  Chapter  IV  provides  conclusions  and  suggests  further  work. 
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II.  TWO  PERSON  ZERO-SUM  GAMES 


A.  DEFINITION 

A  two-person  zero-sum  (TPZS)  game  (von  Neumann  and  Morgenstern,  1944)  is  a 
situation  where  there  are  two  players  having  directly  opposite  interests.  In  a  TPZS  game, 
player  1  (also  called  X,  the  row  player,  or  the  maximizer)  has  m  pure  strategies  and  player 
2(7,  column  player,  minimizer)  has  n  pure  strategies.  A  player  can  commit  to  playing  a 
pure  strategy,  or,  by  randomizing  his  choice  among  several  pure  strategies,  he  can 
employ  a  mixed  strategy.  A  mixed  strategy  is  represented  by  a  vector  of  probabilities  of 
choosing  each  pure  strategy.  For  player  1,  we  write  this  vector  as: 

x  =  (x1,...,xm)T . 

Because  x  is  a  vector  of  probabilities,  we  have  the  restrictions  that 

m 

Za  =1 

z=l 

and 


x,.  >  0,  i  =  . 

Similar  notation  and  restrictions  are  used  for  player  2,  whose  mixed  strategy  is  written  as: 

y  =  (yx,...,yn)T . 

In  a  TPZS  game,  each  player  chooses  a  strategy  (pure  or  mixed),  unknown  to  the  other, 
and  both  strategies  are  revealed  simultaneously.  The  result  of  the  game  depends  on  the 
strategy  used  by  each  player.  If  X  and  7  choose  their  ih  and  jth  pure  strategies, 
respectively,  then  the  result  of  game,  denoted  aip  represents  the  amount  that  7  has  to  pay 
X.  Equivalently,  the  payoffs  to  X  and  7  are  a,j  and  -  aip  respectively.  Note  that  the  sum  of 
the  two  payoffs  is  zero,  which  explains  the  name  of  the  game.  If  two  players  employ 
mixed  strategies  x  and  y  (and  a  pure  strategy  is  just  a  special  case  of  a  mixed  strategy), 
then  the  payoff  to  player  1  is: 
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i= i  ;=i 

which  can  be  seen  as  the  expected  payoff  among  all  of  the  pure  strategies  represented  by 
x  and  y. 

Therefore,  a  TPZS  game  is  completely  defined  when  the  payoff  for  each  pair  of  A 
and  Y  pure  strategies  is  detennined.  These  payoffs  can  be  summarized  in  an  in xn  matrix, 
generally  referred  to  as  a  payoff  matrix,  i.e. 

a\  I  ai2  a\n 

A  _  a2l  a22  •  •  •  a2n 

_fm\  ^  m2  d mn  _ 

and  the  payoff  to  player  1  is  then  xT  Ay . 

In  playing  the  game,  both  players  are  assumed  to  choose  a  strategy  that  achieves 
the  most  favorable  outcome.  This  means  that  X  would  choose  the  strategy  that 
maximizes  xT  Ay  over  all  choices  of  y.  On  the  other  hand,  Y  would  choose  the  strategy 
that  minimizes  xT  Ay  over  all  choices  of  x. 

A  TPZS  game  has  an  equilibrium  point  when  each  player  can  guarantee  an 
optimal  result  by  always  choosing  a  single  pure  strategy.  When  equilibrium  cannot  be 
achieved,  players  must  use  mixed  strategies  to  optimize  the  value  of  the  game. 

T o  choose  the  best  randomized  strategy,  X  must  find  the  x  =  {xx , . . . ,  xm  )  to 

m  m 

max{min[^\x,a() .  ]:  ^jc,.  =1,  jc.  >  0,  i  =  l,...,m}. 

x  j  i=i  i=i 

Similarly,  Y  must  find  the  v  =  (y, , . . . ,  yn )  to 

n  n 

min  { max [ ^ ai}y }  =1»  Yj  -  °>  j  =  h-M  ■ 

y  '  J= 1  7=1 

Let  x*  and  y*  denote  the  optimal  strategies  for X  and  Y.  Then,  v*  =  ( x*)TAy *  is  the  value 
of  the  game.  One  of  the  central  results  of  game  theory  states  (Winston,  1991)  that: 
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v*  =  max{mm[^x;a!}.  ] :  ]Tx(.  =  1,  xi  >  0,  i  =  1 . m}  ,  and 

*  ]  i= i  i=i 

A2  W 

v*  =  min  { max  [£  a.. =  1,  y}  >0,  j  =  1, ...,«}  . 

7  '  y=i  y=i 

B.  LINEAR  PROGRAMMING 

When  the  payoff  matrix  is  specified  and  it  is  not  too  large,  linear  programming 
(Winston,  1991)  can  be  used  to  find  the  optimal  mixed  strategies  and  the  value  of  the 
game.  For  the  maximizer  (player  1),  the  problem  is  to  find  the  mixed  strategy 

n 

x  =  (Xj , . . . ,  xm )  which  maximizes  min  V  x;a..  .  That  is, 

LP 1 :  max  v 

m 

subject  to  -v  >  0,  j  =  1 

i= 1 
m 

^  x(.  =  1,  and  x,  =  1, ...,  m . 

i= i 

Similarly,  the  minim izer  (player  2)  must  solve  LP  2: 

LP  2  :  min  w 

n 

subject  to  y] y.a.y  -  w<  0 ,  i  =  1, ..., m 

7=1 

n 

Z^=i,  and 

7=1 

It  is  easy  to  show  that  problems  LP1  and  LP2  are  duals  of  each  other.  Moreover,  if 
(v*  x*)  and  ( vv *,  v *)  are  optimal  to  problems  LP1  and  LP2,  respectively,  then  v*=vv*. 

C.  FICTITIOUS  PLAY  (FP):  BROWN-ROBINSON  METHOD 

Fictitious  play  (FP)  was  introduced  by  Brown  and  Robinson  (1951).  It  is  an 
iterative  solution  procedure;  in  each  iteration,  players  choose  pure  strategies  that  are  the 
best  response  to  the  empirical  mix  of  their  opponents’  pure  strategies  seen  so  far. 

The  FP  procedure  implemented  here  begins  at  iteration  1  with  player  1  selecting 
that  row  maximizing  the  minimum  row  value,  and  player  2  selecting  that  column 
minimizing  the  maximum  column  value.  Denote  the  players’  pure  strategies  at  iteration  1 
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as  x(1)  and y(l).  These  are  vectors  of  all  zeros  except  for  a  1  at  the  selected  row  or  column 
locations.  At  iteration  2,  player  1  selects  pure  strategy  x(1),  which  is  the  best  row  response 
to  v(1);  and  player  2  selects  pure  strategy  y(2),  which  is  the  best  column  response  to  x(1). 
And  for  general  iteration  k  >  2,  player  1  selects  the  pure  strategy  x(k),  which  is  the  best 
row  response  to 


-a-D 

y 


k  - 


k- 1 

TX 

1  n= 1 


y 


(p) 


and 


player  2  selects  pure  strategy  v(k>,  which  is  the  best  column  response  to 


— (A— 1) 

x 


1 


k- 1 


k- 1 


P= 1 


For  computational  purposes, 


— (A+l) 

x 


is  conveniently  updated  from 


-(*) 

X 


and  xa+1) 


as  follows: 


-(t+i) 

x 


f  k  \ 

V  k  +  1  y 


~(k) 

X 


<k  +  ly 


x(k+1) 


And  similarly  for  player  2, 


-(*+i) 

y 


k  + 1 


-(k) 

■y  + 


f  l 

VA:  +  1 


■y 


(k+i) 


— (k)  — (k) 

Any  limit  points  of  the  sequences  {  x  }  and  {  y  }  are  optimal  mixed  strategy 
solutions  to  the  game.  Also  upper  and  lower  bounds  on  the  value  of  the  game,  v*,  are 
determined  at  each  game  play.  Specifically,  at  game  iteration  k, 


v_tS(xC>-")'Aym<v<(xmyAyM 


=  Vk  . 


and  both  vk  and  v*  converge  to  v  *,  but  not  necessarily  monotonically  (Eagle  and 
Washburn,  1991). 
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D.  BLOTTO  GAMES 


1.  Definition 

In  a  Blotto  game,  there  are  n>\  targets,  or  areas,  for  player  1  (the  attacker)  to 
attack  and  player  2  (the  defender)  to  defend.  The  attacker  chooses  a  vector  of  allocations 
x,  where  Xk  is  the  number  of  attacking  units  assigned  to  area  k,  and  player  1  has /  attackers 

n 

to  distribute,  resulting  in  the  constraint  ^  xk  <  f  .  The  defender  chooses  a  vector  of 

k= 1 

n 

allocations  y  subject  to  ^ yk  <  g ,  where  y k  is  the  number  defenders  assigned  by  player  2 

k= 1 


n 

to  area  k.  The  payoff  is  'Y_iA(xk,yk)  (See  Washburn,  1994,  pp. 107-1 11  for  a  more 


k= 1 


complete  discussion).  All  allocations  are  required  to  be  nonnegative,  and  in  a  discrete 
Blotto  game  they  are  also  required  to  be  integers.  The  number  of  pure  strategies  for 


player  1  is 


77+/-1 

.  /  . 


and,  for  player  2. 


n  +  g- 1 


both  of  which  grow  too  fast  to  allow 


v  S  9 

complete  enumeration  in  even  moderately  sized  games.  Figure  1  displays  a  typical 
increase  in  the  number  of  pure  strategies  for  player  1  as/ or  n  increase. 


Figure  1.  Number  of  pure  strategies  for  player  1,  for/=  1 0  1  <n<  1 00,  and  for  n=  10, 
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1</M00. 


2.  Playability 

In  Blotto  games,  it  is  sometimes  convenient  to  represent  a  mixed  strategy  with  the 
marginal  distributions  of  the  random  vector  X=(Xj, ....  Xn),  where  Xj  is  the  random 
variable  representing  the  number  of  attackers  assigned  to  area  i.  Marginal  distributions 

satisfying  ^Xk  =  f  are  playable  for  the  attacker.  Similarly,  the  marginals  for  the  random 

k 

variable  Y=  (Yj, Yn)  are  playable  for  the  defender  if  ^7^  =  g  .  However,  the  typical 

k 

approach  is  to  relax  these  restrictions  and  simply  require  that  £E(Xt)  =  f  and 

k 

£E(K)  =  g. 

k 

3.  ILP  Formulation  of  Blotto  Games 

Washburn  (1994)  presents  the  LP  formulation  of  Blotto  games  using  the  marginal 
distributions.  However,  those  formulations  do  not  guarantee  playability  (although  the 
discussions  of  those  formulations  claim  that  playability  is  not  an  issue  for  large 
problems).  We  modify  the  formulation  in  Washburn  (1994)  by  explicitly  adding 
constraints  that  are  sufficient  to  enforce  playability.  The  cost  of  such  constraints  is 
twofold:  (1)  they  are  sufficient,  but  not  necessary  conditions  for  playability,  so  the 
solutions  we  obtain  are  potentially  suboptimal;  and  (2)  they  introduce  integer  variables, 
rendering  integer  linear  programming  (ILP)  fonnulations. 
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ILP  1  solves  a  Blotto  defense  game  for  the  defender’s  marginals  ( v, , . . . ,  yg ) , 
where  v,  represents  the  probability  that  j  defenders  are  used  in  any  given  area.  Therefore, 

'Ll  ■  y,  =  EVk) . 

k 

ILP  1 :  min  v  =  nc  +  df 

y,  c,  d 

g 

s.t  ^  A(i,j)  ■  y j  <  c  +  d  -i;  i  =  0,l...,f 

j= o 

g 

2>y  ■  J-s1  n 

j= 0 

2>;  =  1 

j= 0 

ycountj  =  n  ■  v7 

yy>0,  for  ally  =  0,1, ...,g 

ycountj  e  {0,...,g}  and  d  >  0 

ILP  2  solves  a  Blotto  attack  game  for  the  attacker’s  marginals  (xx,...,xf) .  The 
integer  restrictions  ox  ycountj  and  xcounti  are  used  to  require  playability. 

ILP  2:  max  v  =  na  -  bg 

x,  a,  b 

/ 

s.t  YjA(i,j)-xi  >  ct-b-j;  y'  =  0,l...,g 

i= 1 

Yjx:-i<  fin 

i= 1 

=1 

1=1 

xcounti  =  n  ■  x; 

x,.  >  0,  forall /' =  0,1,...,/ 

xcountf  e  {0, •••,/}  and  h>  0 

These  playability  constraints  are  too  restrictive;  they  are  sufficient  to  enforce 
playability  but  they  are  provably  not  necessary. 
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4.  New  Fictitious  Play  Procedure 

We  derive  the  dynamic  programming  recurrence  relation  for  solving  Blotto  games 
with  FP,  using  a  general  payoff  function  (x,  v) ,  which  is  the  amount  player  2  pays  to 

player  1  when  player  1  allocates  x  units  to  area  k  and  player  2  allocates  y  units  to  area  k. 
The  total  payoff  is  obtained  by  summing  the  rewards  over  the  n  areas.  The  recurrence 
can  be  solved  without  explicitly  keeping  track  of  every  attack  or  defense  played;  rather, 
the  infonnation  required  is  simply  the  number  of  times  a  given  force  level  (number  of 
attackers  or  defenders)  has  been  used  in  each  area,  over  all  attacks  and  defenses  seen  so 
far. 


We  first  consider  the  defender’s  problem  at  FP  iteration  K,  which  is  to  allocate  g 
defenders  over  n  cities  to  minimize  the  expected  payoff,  given  that  K  attacks  have  been 
observed  so  far.  Each  attack  can  be  represented  by  a  column  vector  ak  =  (ak akn  )T , 

k  =  \,...,K  .  We  define  the  values^  =  f  ,  where  /  t  ^  represents  the  indicator 


variable  for  the  event,  “the  kth  attack  used  j  attackers  in  area  Therefore,  represents 

the  number  of  times  exactly  j  attackers  have  been  used  against  area  i.  We  first  determine 
the  value  of  placing  g  defenders  optimally  in  area  n.  Then  we  define  a  recurrence 
relation  on  a  value  function  vt(p)  that  represents  the  expected  payoff  of  placing  p 

defenders  optimally  in  areas  i,  z+1 , . . . ,  n.  ThenVj(g)  is  the  solution  to  the  original 
problem.  The  boundary  condition  is  given  by 


1 


'n(v)  =  —YJr„PA„(P,q),  q  =  0,...,g 


K  pTo 


(1) 


which  is  the  total  expected  payoff  when  the  defender  uses  q  defenders  in  area  n.  This 
states  that  the  optimal  defender  strategy  when  only  area  n  remains  is  to  allocate  all  q 
remaining  defenders  to  that  area.  The  recurrence  for  i  e  (1,. . .  ,n  -1}  is: 


K  i=» . 1 i 


Ai(p,j)  +  vM(q-j)\ 


(2) 
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This  is  the  expected  payoff  in  area  i  plus  the  expected  payoff  generated  by  placing 
the  remaining  (q-j)  defenders  optimally  in  areas  n.  The  optimal  defender 

allocation  to  area  i  is  the  value  of  j  minimizing  equation  (2). 

Similarly,  for  the  attacker’s  problem,  we  assume  that  K  defenses  have  been 
observed  so  far,  where  the  kth  defense  is dk  =  {dk  ,dk_,...,dk)T .  The  attacker  wishes  to 

allocate/ forces  over  the  n  areas  to  maximize  the  expected  payoff.  Let  s /  =  be 

the  number  of  times  j  defenders  are  placed  in  area  /'.  Define  wjp)  as  the  maximum 
possible  expected  payoff  in  areas  The  boundary  conditions  are: 

w,Xp)  =  -^Tjs"  A„(p,q),  p  =  0,...,f,  (3) 

&  <7=0 

and  the  recurrence  is  given  by: 

W,  ip)  =  77  max  |  ^ sf  4 (./,  q )  +  wM (p  -  j)  \  .  (4) 

K  J=°’  ’P  J 

The  optimal  solution  for  the  attacker  is  represented  by  wt  (/ )  and  the  corresponding 
decisions  j  maximizing  (4)  for  each  area. 

Blotto  games  can  be  extended  immediately  to  the  case  where  the  attacker 
possesses  different  numbers  of,  say,  two  types  of  attacking  units,  /  and  /,  and  the 
defender  also  has  a  supply  of,  say,  two  types  of  defending  unit,  gi  and  g2.  The  payoff 
function  now  depends  on  the  number  of  attackers  and  defenders  of  each  type  allocated  to 
each  area:  4  ( p{,  p2,q^q2)  .  If  we  define  r/' Jl  as  the  number  of  times  /  attackers  of  type 

1  and  ji  of  type  2  have  been  used  in  area  i,  and  similarly  for  sj  ’h  ,  then  our  value 
functions  are  two-dimensional,  with  boundary  conditions 

l  f  h 

A  Pi=0p2=0 

and 
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(6) 


l  Si  g  2 

Wn(Pl  ,  Pi)  =  ~  Z  Z  Sn"qi  An  (A  >  Pi  >  A  ><h)  > 

q1=0  q2=0 


and  recurrences 


vMv<k)  =  —  min 

K  Ji=0’  -’<h 

j2=0,...,q2 


J 1  J  2 

ZZ  f  A  4 (/?, ,  p2 ,  j\ ,j2)  +  vM  (ql  -  j\ , q2  -  j2  ) 

p,=0p2=0 


(V) 


and 


wSP^Pi)  =  ]t  max 

K  h  =0,...,A 

j2=0,...,p2 


?1.?2 


4  O'i  ,  j2  ,ql  ,q2 )  +  wM  (p,  -  j\  ,p2~j2)\ 


(8) 


Clearly,  the  size  of  the  static  space  grows  with  the  product  of  the  number  of  each  type  of 
attacker  or  defender.  It  is  still  manageable  with  just  a  few  types  of  attacker  or  defender. 
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III.  DATA  ANALYSIS  AND  RESULTS 


A.  MODEL  IMPLEMENATION 

The  FP  model  is  implemented  in  MATLAB  (Version  6.5).  The  ILP  solution 
procedure  is  implemented  in  GAMS  (Revision  135,  XA  solver).  Computations  are  done 
on  a  1.5  GHz  Intel  Centrino-based  laptop  computer  with  512  MB  of  RAM.  All  computer 
code  appears  in  Appendices  A  and  B. 


B.  NUMERICAL  RESULTS 

1.  Rate  of  Convergence  of  FP 

Define  gap(k)  to  be  the  difference  between  the  upper  and  lower  bounds  on  the 
value  of  the  game  at  FP  iteration  k.  Consistent  with  earlier  FP  studies  (Washburn,  2001), 
we  find  that  the  FP  gap  plotted  against  number  of  iterations  is  approximately 
asymptotically  linear  on  a  log-log  plot.  That  is,  for  large  enough  k, 

gap(k)  «  ,  or 

\og{gap{k))  a  log(a)  -  b  log  k , 

where  k  is  number  of  iterations  and  a  and  b  are  fitted  constants.  Limited  numerical 
experimentation  suggests  that  using  a  least  square  fit  and  dropping  first  100  iterations  the 
intercept  (log(o))  increases  with  increasing  /  or  g,  and  the  slope  (- b )  increases  (to 
approximately  -1/2)  with  increasing  n.  These  observations  are  illustrated  in  Figures  2,  3 
and  4. 
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Case 

f  g.  n 

#  pure  strategies 

Slope 

Intercept 

Final  Gap 
(UB-LB) 

Attacker 

Defender 

At 

50,  200,  10 

12565671261 

1.76081E+15 

-0.48374 

1.8887 

3.8396 

A2 

25,  100,  10 

52451256 

4.26342E+12 

-0.48378 

1.5860 

1.5260 

A3 

5,  20,  10 

2002 

10015005 

-0.48376 

0.8887 

0.3052 

Figure  2.  With  n  fixed,  the  slope  remains  constant  and  the  intercept  increases  with / 

and  g. 

In  Figure  2  we  see  the  gap  between  the  upper  and  lower  bounds  plotted  against 
the  number  of  iterations  of  fictitious  play,  on  log-log  scale.  The  slope  of  the  fitted  line 
(from  the  column  labeled  “slope”  in  the  table  above  the  plot)  indicates  the  rate  of 
convergence.  Note  that  the  slope  is  constant  as/ and  g  increase,  and  n  remains  fixed. 
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Case 

f  g,  n 

#  pure  strategies 

Slope 

(a) 

Intercept 

(b) 

Final  Gap 
(UB-LB) 

Standard 

Error 

Attacker 

Defender 

B1 

30,  50,  30 

5.91E+16 

3.33E+21 

-0.517 

1.8541 

2.0374 

0.0094 

B2 

30,  50,  20 

1.89E+13 

4.63E+16 

-0.543 

1.8954 

1.7824 

0.0157 

B3 

30,  50,  15 

1.15E+11 

4.79E+13 

-0.580 

1.9463 

1.5238 

0.0187 

B4 

30,  50,  10 

2.12E+08 

1.26E+10 

-0.588 

1.8839 

1.1999 

0.0270 

B5 

30,50,  5 

46376 

316251 

-0.851 

2.1746 

0.4787 

0.0469 

B6 

30,50,  2 

31 

51 

-1.093 

1.4431 

0.0216 

0.0194 

a=-0.543  (B2) 
a=-0.580  (B3) 
a=-0.588  (B4) 


1.5  2 

log(  #  Iteration) 

Figure  3.  The  best-fit  slope  increases  with  n  if  and  g  fixed). 


•  B2:  n=20 

—  Predicted  B2 
+  B3:  n=15 

—  Predicted  B3 
B4  n=10 

—  Predicted  B4 


If  we  increase  n  for  a  fixed  f  and  g,  we  see  in  Figure  3  that  the  slope  increases, 
and  appears  to  approach  a  limit  of  -0.5  (Figure  4).  This  is  consistent  with  conjectures  of 


convergence  of  FP. 
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Figure  4.  Convergence  of  asymptotic  slope 

2.  Elapsed  Time  per  FP  Iteration 

We  observe  that  for  all  tested  values  of  f  g  and  n,  the  elapsed  time  per  FP 
iteration  is  constant  as  the  number  of  FP  iterations  k  increases.  This  is  illustrated  in 
Figure  5  and  occurs  because  the  amount  of  FP  data  required  to  be  manipulated  and  stored 
does  not  increase  with  k. 
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Iteration 

1000 

2000 

3000 

4000 

5000 

6000 

7000 

8000 

9000 

10000 

f  g,  n 

Elapsed  time  (sec) 

5,  6,  10 

2.4 

4.6 

6.8 

9.0 

11.2 

13.3 

15.5 

17.7 

19.9 

22.2 

20,  25,  30 

24.3 

48.4 

72.6 

96.7 

121.4 

145.5 

169.1 

193.8 

217.6 

242.0 

40,  45,  50 

81.7 

163.4 

245.2 

326.4 

409.2 

490.4 

572.2 

653.1 

736.6 

817.4 

Figure  5.  FP  iterations  vs.  Elapsed  time  for  3  games 
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3.  Comparisons  with  FP  and  ILP  Procedure 

Comparisons  are  made  between  the  ILP  and  FP  solution  procedures.  Three 
different  payoff  functions  are  examined. 

a.  Convex  Payoff  Function 

The  convex  payoff  function  is  given  by 


A(xt, yt )  =  max (x;.  -  y. , 0) . 
Figure  6  shows  how  the  two  procedures  performed. 


FP  (k  = 

2000) 

ILP 

Run 

f  g>  n 

Elapsed 

time 

Upper 

Lower 

Gap 

Elapsed 

time 

Value  of 
game 

1 

3, 

4, 

5 

1.962 

2.208 

2.179 

0.029 

0.10 

2.2 

2 

6, 

8, 

5 

2.774 

4.415 

4.358 

0.057 

0.18 

4.4 

3 

12, 

16, 

5 

4.657 

8.831 

8.717 

0.114 

0.22 

0° 

oo 

4 

15, 

20, 

5 

5.668 

11.039 

10.896 

0.143 

0.19 

11 

5 

30, 

40, 

5 

11.147 

22.077 

21.792 

0.285 

0.25 

22 

6 

60, 

OO 

o 

5 

24.535 

44.154 

43.584 

0.570 

0.12 

44 

7 

120, 

160, 

5 

59.766 

88.308 

87.168 

1.140 

0.21 

88 

8 

150,  200, 

5 

83.801 

110.385 

108.960 

1.426 

0.12 

111 

1.6 

1.4 

1.2 

1.0 

0.8  ro 
O 

0.6 
0.4 
0.2 
0.0 

1  2  3  4  5  6  7  8 

Run 

Figure  6.  Comparison  of  FP  and  ILP  procedures  with  convex  payoff  function 
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b.  Capacitated  Payoff  Function 
The  capacitated  payoff  function  is 


A(xt>yt) 


j max (x,  - y. , 0),  x,  - yi  <  cap 
[  cap,  xt  -  yt  >  cap  ’ 


where  cap  is  the  maximum  possible  payoff.  We  note  that  the  ILP  objective  function  need 
not  be  either  convex  or  concave.  Figure  7  shows  how  the  two  procedures  perfonned. 


Run 

fg>  n 

FP  (k  =  2000) 

ILP 

Elapsed 

time 

Upper 

Lower 

Gap 

Elapsed 

time 

Upper 

Lower 

Gap 

1 

3,  4,5 

1.743 

2.208 

2.179 

0.029 

0.180 

2.200 

2.200 

0.000 

2 

6,  8,5 

2.704 

2.958 

2.872 

0.086 

0.100 

3.120 

2.800 

0.320 

3 

12,  16,5 

4.697 

3.935 

3.767 

0.168 

0.100 

4.114 

3.800 

0.314 

4 

15,  20,5 

5.748 

4.252 

4.078 

0.174 

0.170 

4.333 

4.000 

0.333 

5 

30,  40,5 

11.226 

4.884 

4.619 

0.265 

0.190 

5.143 

4.200 

0.943 

6 

60,  80,5 

24.536 

5.350 

4.960 

0.389 

1.420 

5.807 

4.200 

1.607 

7 

120,  160,  5 

59.946 

5.622 

5.130 

0.492 

62.380 

6.250 

4.200 

2.050 

8 

150,  200,5 

82.278 

5.713 

5.144 

0.570 

102.170 

6.338 

4.091 

2.247 

Figure  7.  Comparison  of  FP  of  ILP  procedures  with  the  capacitated  payoff  function 
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c.  Binary  Payoff  Function 

The  binary  payoff  function  is 


A(xi>yt) 


jo,  Xi-yt<  0 
jl,  xi-yi>  0 


As  with  the  capacitated  payoff  function,  the  ILP  in  this  case  need  not  be  either  convex  or 
concave.  Figure  8  shows  how  the  two  procedures  performed. 


Run 

f  g>  n 

FP  (k  =  2000) 

ILP 

Elapsed 

time 

Upper 

Lower 

Gap 

Elapsed 

time 

Upper 

Lower 

Gap 

1 

3,  4,5 

1.021 

1.216 

1.179 

0.037 

0.12 

1.200 

1.200 

0.000 

2 

6,  8,5 

2.805 

1.477 

1.407 

0.070 

0.17 

1.600 

1.400 

0.200 

3 

12,  16,5 

4.757 

1.675 

1.574 

0.101 

0.19 

1.920 

1.400 

0.520 

4 

15,  20,  5 

5.668 

1.721 

1.618 

0.103 

0.11 

2.000 

1.333 

0.667 

5 

30,  40,  5 

11.186 

1.850 

1.694 

0.156 

0.30 

2.000 

1.400 

0.600 

6 

60,  80,5 

24.756 

1.921 

1.716 

0.205 

2.57 

2.167 

1.400 

0.767 

7 

120,  160,  5 

60.758 

1.990 

1.703 

0.287 

61.96 

2.182 

1.400 

0.782 

8 

150,  200,5 

83.260 

2.012 

1.686 

0.326 

149.89 

2.214 

1.400 

0.814 

0.9 

0.8 

0.7 

0.6 

0.5  a 

CD 

0.4  CO 

0.3 

0.2 

0.1 

0.0 


Figure  8. 


Comparison  of  FP  and  ILP  procedures  with  the  binary  payoff  function 
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C.  CONCLUSIONS 

As  has  been  observed  in  earlier  FP  studies  the  FP  gap  (the  difference  between  the 
upper  and  lower  bounds  or  game  value)  as  a  function  of  number  of  FP  iterations,  k,  is 
approximately 

gap(k)  *  ^  , 

for  large  enough  k. 

The  best-fit  a  increases  with /  and  g,  and  the  best-lit  b  decreases  to  approximately 
1/2  with  increasing  n. 

Because  of  efficiencies  realized  in  the  DP  procedure  used  to  solve  the  FP 
subproblems,  the  computation  time  required  for  each  FP  iteration  is  approximately 
constant  as  the  number  of  FP  iterations  increases,  for  fixed  f  g  and  n. 

For  the  convex  payoff  function  tested,  the  ILP  formulation  solved  with  GAMS 
was  faster  and  more  accurate  than  the  FP  procedure. 

For  the  non-convex  payoff  functions  tested,  the  FP  procedure  was  more 
competitive  and  sometimes  significantly  outperformed  than  ILP  procedure. 
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IV.  CONCLUSIONS  AND  FURTHER  STUDY 


We  propose  a  new  efficient  fictitious  play  (FP)  procedure  to  solve  two-person 
zero-sum  Blotto  games.  The  algorithm  uses  dynamic  programming  (DP)  to  solve  the  FP 
subproblems  at  each  iteration.  By  representing  intermediate  mixed  strategies  through 
marginal  distributions  are  keep  the  state  space  of  the  DP  manageable  and  independent  of 
the  number  of  iterations.  Although  our  experiments  considered  one  type  of  attacker  and 
one  type  of  defender,  we  indicate  how  to  generalize  this  procedure  to  cases  with  more 
than  one  type  of  attacker  or  defender  (or  both). 

During  this  study,  we  identified  other  topics  for  further  investigations.  The  first  is 
to  investigate  generalizations  of  Blotto  games  in  which  defenders  can  be  placed  in  such  a 
way  as  to  defend  multiple  areas  at  once.  This  is  closer  to  the  real  situation  with  missile 
defense.  The  second  is  to  explore  the  issue  of  playability  in  the  ILP  formulations.  Our 
proposed  playability  constraint  is  currently  too  restrictive;  we  have  provided  examples  in 
which  the  optimal  solution  to  the  ILP,  with  the  playability  constraint,  is  not  equal  to  the 
value  of  the  game.  Further  research  should  explore  less  restrictive,  alternate  formulations 
of  playability  constraints.  It  is  possible  (although  unlikely)  that  less  restrictive  playability 
constraints  would  also  yield  more  efficiently  solvable  ILP,  making  that  approach 
competitive  with  the  DP-based  procedure  for  larger  problems. 
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APPENDIX  A.  MATLAB  CODE  FOR  NEW  FP  PROCEDURE 


1.  Blottofp.m 


%  Blotto  Game  Fictitious  Play  Solution  with  general  payoff  matrix 

%  n  =  #  areas  to  be  attacked  and  defended 
%  f  =  #  attackers  to  be  allocated  to  the  n  areas 
%  g  =  #  defenders  to  be  allocated  to  the  n  areas 

%  Initialize  with  a  vector  of  marginals  for  the  attacker  and 
%  defender. 

%  initialize  A  for  attacker  (each  row  is  frequency  of 
%  0,1,2.  ...  ,f  being  assigned  to  area  i=l,...,n), 

%  initialize  D  for  defender  (each  row  is  frequency  of 
%  0,1,2,  ...  ,g  being  assigned  to  area  i=l,...,n), 

%  initalize  upper  and  lower  bounds  on  the  value  of  the  game,  and 
%  number  of  iterations  desired. 

%  Calculate  P(a,d)  =  return  if  a  attackers  and  d  defenders 
%  attack  any  area 

clear; 

f  =  150; 
g  =  200; 
n  =  5  ; 

num  its  =  2000; 

for  i  =  1 : f +1 

for  j  =  1 : g+1 

P (i,  j ) =linearpayof f (i, j ) ; 

%  P ( i , j ) =cappedpayof f ( i , j ) ; 

%  P ( i ,  j ) =binarypayof f ( i , j ) ; 

end 

end 

%  Set  initial  A  for  all  attacks  in  area  1 
A  ( 1 , : )  =  [zeros (1, f) , 1] ; 
for  k=2 : n 

A ( k,  : )  =  [1, zeros (1, f) ] ; 

end 

%  Set  initial  D  to  all  defenses  in  area  1 
D  ( 1 , : )  =  [zeros (l,g) ,1] ; 
for  k=2 : n 

D  (k,  : )  =  [1,  zeros  (1,  g)  ]  ; 

end 

v  up  =  f;  %  assumes  all  missiles  are  leakers 
v  low  =  0;  %  assumes  no  missiles  are  leakers 

%  preallocates  v  up  and  v  low  in  memory 
v  up=v  up* ones ( 1 , num  its+1); 


%  number  of  attackers 
%  number  of  defenders 

%  number  of  areas  to  attack  and  defend 
%  number  of  FP  iterations 
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v  low=v  low*ones ( 1 , num  its+1); 
for  k=l:num  its 

[opt_atk, v_u]  =  attacker (D, P, n, f, g) ; 

%  Find  best  pure  attack  for  defenses  seen  so  far 
%  (and  upper  bound) . 

[opt_def , v_l]  =  defender (A, P, n, f, g) ; 

%  Find  best  pure  defense  for  attacks  seen  so  far 
%  (and  lower  bound) . 

A  =  update  attack  marginals (A,  opt  atk)  ; 

%Update  attack  and  defense  marginals. 

D  =  update  defense  marginals (D, opt  def ) ; 
v  up(k+l)  =  min (v  u,v  up(k)); 
v  low(k+l)=  max (v  l,v  low(k)); 

if  (k/100  ==  floor (k/100) )  ,  home,  k,  gap= (v_up ( k+1 ) - 

v  low ( k+1 ) )  ,  end 
end 

figure ( 1 ) 

loglog ( [ 0 : num_its ] , (v_up  -  v_low) ,  ' ro ' ) , grid  on,  title  ('  (upper 
bound  -  lower  bound)  vs.  #FP  iterations') 
figure (2 ) 

plot ( [ 0 : num_its ] , v_up, ' go ' , [ 0 : num_its ] , v_low, ' rx ' ) , grid  on, 

axis([0  num  its  0  f ]), title (' Upper  and  lower  bounds  vs.  #FP 
iterations ' ) 

bounds  =  [v_up (end) , v_low (end) ] 
meanA  =  mean (A) /num  its 
meanD  =  mean (D) /num  its 
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2. 


Attacker.m 


function  [opt_attack, v_up]  =  attacker (D, P, n, f, g) 

%  Takes  defenses  D  and  returns  optimal  attack  column  vector  and 
%  an  upper  bound  on  the  value  of  the  Blotto  game. 

%  n  areas,  f  attackers,  g  defenders 
%  Uses  general  payoff  function  P(a,d) 

%  #defenses  (increases  with  FP  iterations) 

num^defenses  =  sum(D(l,:)); 

s=D*P';  %  compute  s(i,j)  =  exp.  1-step  payoff 

%  when  j  attackers  assigned  to  area  i. 

%  Uses  the  marginal  matrix 

%  D(i,j)  =  #  times  j  defenders  assigned  to  area  i. 

v  star  =  zeros (n, f+1 ) ;  %initialize  v  star(i,j)  =  optimal 
%  return  when  j  attackers  are  available  for  i  areas 
a_star  =  zeros (n, f+1) ;  %initialize  a  star(i,j)  =  optimal 
%  #  attackers  to  use  when  j  attackers  are  available  for  i  areas 

%  n  =  1 

v  star(l,:)  =  s  ( 1 ,  : ) ;  %  optimal  payoff  when  1  area  is  included 
a_star(l,:)  =  [ 0 : f ] ;  %  optimal  #attackers  when  1  area  is  included 

%  n  >  1 

for  i=2:n  %  areas 

for  j=0:f  %  j  attackers  remain  to  be  used 

v  starnew  =  zeros  (1, j+1) ;  %initialize  a_starnew 
for  k=0 : j 

v  starnew(k+l)  =  s(i,k+l)  +  v  star (i-1 ,  j -k+1 )  ; 

%  enumerating  possible  returns 
end 

[v_star ( i , j +1 ) , a_star ( i , j +1 ) ]  =  max (v_starnew) ; 

%  identifying  the  #  attackers  giving  a  max.  payoff 
a_star (i, j +1 )  =  a_star (i, j+1) -1; 

%  correcting  for  col.  1  being  for  0  attackers 
end 

end 

v  up  =  v  star (n, f+1) /num  defenses; 

opt  attack  =  zeros (n, 1) ;  %  initialize  opt  attack 

opt_attack (n)  =  a_star (n, f+1 ) ; 

%  establish  optmal  #attackers  to  use  for  n  areas 

attackers  remaining  =  f  -  opt  attack (n); 

%  update  #attackers  remaining  for  remaining  n-1  areas 

for  i=n-l : -1 : 1 

opt_attack ( i )  =  a_star ( i , attackers_remaining+l ) ; 
attackers  remaining  =  attackers  remaining  -  opt  attack (i); 

end 
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3. 


Defender.m 


function  [opt  defense, v  low]  =  defender (A, P, n, f, g) 

%  Takes  attacks  A  and  returns  optimal  defense  column  vector  and 
%  an  lower  bound  on  the  value  of  the  Blotto  game. 

%  n  areas,  f  attackers,  g  defenders 
%  Uses  general  payoff  function  P(a,d) 

num^attacks  =  sum(A(l,:)); 

q=A*P;  %  compute  q(i,j)  =  exp.  1-step  payoff 

%  when  j  attackers  assigned  to  area  i. 

%  Uses  the  marginal  matrix 
%  A(i,j)  =  #  times  j  attackers  assigned  to  area  i. 

r  star  =  zeros (n, g+1 ) ;  %initialize  r  star  and  d  star 
d_star  =  zeros (n, g+1 ) ; 

%  n  =  1 

r_star (1,  : )  =  q  (1,  :  )  ; 
d_star(l,:)  =  [ 0 : g] ; 

%  n  >  1 
for  i=2 : n 

for  j=0:g 

r  starnew  =  zeros  (1, j+1)  ;  %initialize  r  starnew 
for  k=0 : j 

r  starnew(k+l)  =  q(i,k+l)  +  r  star (i-1 , j -k+1 ) ; 
%  enumerating  possible  #  defenders 
end 

[r_star (i, j+1) , d_star (i, j+1) ]  =  min (r_starnew) ; 

%  identifying  the  #  defenders  giving  a  min.  payoff 
d_star (i, j +1 )  =  d_star (i, j+1) -1; 

%  correcting  for  col.  1  being  for  0  defenders 
end 

end 

v  low  =  r  star (n, g+1 ) /num  attacks; 
opt_defense  =  zeros (n, 1) ; 
opt_defense (n)  =  d_star (n, g+1 ) ; 
defenders  remaining  =  g  -  opt  defense (n); 
for  i=n-l : -1 : 1 

opt  defense (i)  =  d  star ( i , defenders  remaining+1 ) ; 
defenders  remaining  =  defenders  remaining  -  opt  defense (i) 

end 
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4.  Updateattackmarginals.m 

function  [A  new]  =  update  attack  marginals (A, opt  atk) 
[n,  m] =size  (A)  ; 

A  new=A; 
for  k=l : n 

z=opt_atk ( k) ; 

A  new(k,z+l)=A  new ( k, z+1 ) +1 ; 

End 


5.  Updatedefensemarginals.m 

function  [D  new]  =  update  defense  marginals (D, opt  def) 
[n, m] =size (D) ; 

D  new=D; 
for  k=l : n 

z=opt_def  ( k)  ; 

D  new(k,z+l)=D  new ( k, z+1 ) +1 ; 

end 


6.  Payoff.m 


Unconstrained  target  case: 

Linearpayof f .  m 

function  P=linearpayof f (a, d) 
if  a>d 

P=a-d; 

else 

P=0 ; 

end 

Capacitated  target  case: 

Cappedpayof f .  m 

function  P  =  cappedpayof f (a, d, cap) 
P=max  (0,  min  (a-d,  cap)  )  ; 


Binary  target  case: 

Binarypayof f .  m 

function  P  =  binarypayof f (a, d) 
P= (a>d) ; 
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APPENDIX  B.  GAMS  CODE  FOR  ILP  PROCEDURE 


*  Blotto  Attack  &  Defense  game 

*  Attacker  is  the  row  player  and  maximizer. 

*  Defender  is  the  column  player  and  minimizer. 

*  Uses  integer  restrictions  to  require  playability 


$off listing 
$inlinecom  {  } 

OPTIONS 


SOLPRINT  = 

OFF, 

DECIMALS  = 

2, 

LIMCOL 

0, 

LIMROW 

0, 

RESLIM 

86400, 

ITERLIM  = 

100000, 

OPTCA 

\ — 1 
O 

o 

OPTCR 

o 

o 

o 

lp 

xa. 

MIP 

xa;  {XA 

{MAX  SECONDS} 

{MAX  PIVOTS} 

{ABSOLUTE  INTEGRALITY  TOLERANCE} 
{RELATIVE  INTEGRALITY  TOLERANCE} 

Solver } 


SCALARS 


N 

total 

number 

of 

areas 

/  5 

/ 

F 

total 

number 

of 

attackers 

/15 

/ 

G 

total 

number 

of 

defenders 

/20 

/ 

Time 

execution  time 

/o 

/ 

SETS 

i  #  attackers  used  /0attack*15attack/ 

j  #  defenders  used  /0defend*20defend/ 


PARAMETER  P(i,j)  damage  done  when  i  attack  and  j  defend; 

LOOP  ((i,j),  P ( i , j ) =max ( (ord ( i ) -ord ( j ) ) , 0 ) ) ;  {convex  payoff} 

*LOOP  ((i,j),  P (i, j ) =min (P (i, j ) , 3) ) ;  {capacitated  payoff} 

*LOOP  ( (i, j ) ,  if  (P ( i , j ) >0 , 

P(i, j)=i; 

else 

P(i, j ) =0; ) 

) ;  {binary  payoff} 


VARIABLES 

vl  value  of  game  for  defenders 

v2  value  of  game  for  attackers 

x(i)  Attacker  marginal  distribution  on  #  attackers  used 

y(j)  Defender  marginal  distribution  on  #  defenders  used 

xcount(i)  Attacker  marginal  dist.  *  N 
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{=  #  times  i  attackers  used} 
ycount(j)  Defender  marginal  dist.  *  N 
{=  #  times  j  defenders  used} 
a  x-intercept 

b  slope 

c  y-intercept 

d  slope; 

POSITIVE  VARIABLES  x,  y,  b,  d  ; 


INTEGER  VARIABLES  xcount,  ycount  ; 

* POSITIVE  VARIABLES  xcount,  ycount  ; 


EQUATIONS 

ob j  ectivel 
expreturn^y ( i ) 
meande fenders 
probsum_y 

*  extraconstraintO 


objective  funtion 
expected  return 

constraint  on  mean  number  of  defenders 
sum  of  probabilities  is  1 
extra  contraits  to  help  explore 


different  optimal  solutions 
ycountdef(j)  definition  of  ycount 


obj ectivel..  vl  =e=  N*c  +  d*F  ; 

expreturn_y ( i ) . .  sum(j,  P(i,j)*y(j))  =1=  c+d* (ord (i) -1) ; 

meandefenders . .  N*sum ( j , y ( j ) * (ord ( j ) -1 ) )  =1=  G  ; 

probsum_y. .  sum(j,y(j))  =e=  1  ; 

*extraconstraintO . .  y ( ' lOdefend ' )  =e=  .25  ; 

ycountdef ( j ) . .  ycount (j)  =e=  N*y(j)  ; 

MODEL  Defense  /objectivel, 
expreturn_y, 
meandefenders , 
probsum_y 
ycountdef 

/; 

EQUATIONS 

obj  ective2 
expreturn_x ( j ) 
meanatt ackers 
probsum  x 
xcountdef ( i ) 


objective2  ..  v2  =e=  N*a  -  b*G  ; 

expreturn_x ( j )  ..  sum(i,  x(i)*P(i,j))  =g=  a-b* (ord ( j ) -1) ; 

meanattackers  ..  N*sum (i, x (i) * (ord (i) -1 ) )  =1=  F  ; 


objective  funtion 
expected  return 

constraint  on  mean  number  of  attackers 
sum  of  probabilities  is  1 
definition  of  xcount 
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probsum  x  ..  sum(i,x(i))  =e=  1  ; 

xcountdef ( i ) . .  xcount(i)  =e=  N*x(i) 

MODEL  Attack  /objective2, 
expreturn  x, 
meanatt ackers , 
probsum  x 
xcountdef 

/; 


SOLVE  Defense  using  mip  minimizing  vl; 

SOLVE  Attack  using  mip  maximizing  v 2; 

Time=Time+Defense . resusd+Attack . resusd;  {computation  time} 

DISPLAY  vl.l,  v2 . 1 ,  y.l,  x.l; 

DISPLAY  ycount . 1 , xcount . 1 ,  Time; 
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